# REDUCED COMPLEXITY VIDEO DECODING AT FULL RESOLUTION USING VIDEO EMBEDDED RESIZING

## Background of the Invention

The present invention relates generally to video compression, and more particularly, to decoding where embedded resizing is used in conjunction with external scaling in order to reduce the computational complexity of the decoding.

Video compression incorporating a discrete cosine transform (DCT) is a technology that has been adopted in multiple international standards such as MPEG-1, MPEG-2, MPEG-4, and H.262. Among these schemes, MPEG-2 is the most widely used, in DVD, satellite DTV broadcast, and the U.S. ATSC standard for digital television.

An example of a MPEG video decoder is shown in Figure 1. The MPEG video decoder is a significant part of MPEG-based consumer video products. In such products, a desirable goal is to minimize the complexity of the decoder while maintaining the video quality.

## Summary of the Invention

The present invention is directed to decoding a video bitstream at a first resolution where embedded resizing is used in conjunction with external scaling in order to reduce the computational complexity of the decoding. According to the present

invention, residual error frames are produced at a second lower resolution. Motion compensated frames are produced also at the second lower resolution. The residual error frames are then combined with the motion compensated frames to produce video frames. Further, the video frames are up-scaled to the first resolution.

According to the present invention, the up-scaling may be performed by a technique selected from a group consisting of repeating pixel values and linear interpolation. Further, the upscaling is performed in a same direction as down scaling in the residual error frames. In one example of the present invention, the up-scaling is performed in a horizontal direction.

## Brief Description of the Drawings

Referring now to the drawings were like reference numbers represent corresponding parts throughout:

Figure 1 is a block diagram of a MPEG decoder;

Figure 2 is a block diagram of one example of a decoder according to the present invention;

Figure 3 is a block diagram of another example of a decoder according to the present invention; and

Figure 4 is a block diagram of one example of a system

\\SERVERO\\SYS2\\WPDOCS\\GR\\us010341-spec.doc according to the present invention.

## Detailed Description

The present invention is directed to decoding where embedded resizing is used in conjunction with external scaling in order to reduce the computational complexity of the decoding. According to the present, a video bitstream is decoded with a reduced output resolution using embedded resizing. The output video is then up scaled to the display resolution using external scaling. Since the embedded resizing may enable both the inverse discrete transform (IDCT) and motion compensation (MC) to be performed at a lower resolution, the overall computational complexity of the decoding is reduced.

One example of a decoder according to the present invention is shown in Figure 2. As can be seen, the decoder includes a first path made up of the variable length decoder (VLD) 2, inverse scan and inverse quantization (ISIQ)/filtering block 14, 8X8 IDCT 16 and decimation block 18.

During operation, the VLD 2 will decode the incoming video bitstream to produce motion vectors (MV) and DCT coefficients. The ISIQ/filtering block 14 then inverse scans and inverse quantizes the DCT coefficients received from the VLD 2. In MPEG-2, inverse

\\SERVERO\\SYS2\\WPDOCS\\GR\\us010341-spec.doc zig-zag scanning is performed. Further, the IDCT/filtering block 14 also performs filtering to eliminate high frequencies from the DCT coefficients.

In this embodiment, the 8x8 IDCT 16 performs an inverse discrete transform in 8X8 blocks to produce blocks of pixel values. After performing the IDCT, the decimation block 18 then samples the output of the 8X8 IDCT 16 at a predetermined rate in order to reduce the resolution of the video frames being decoded. According to the present invention, the decimation block 18 may sample the pixel values in the horizontal direction, vertical direction or both.

Further, the sampling rate of the decimation block 18 is chosen according to the desired level of internal scaling. In this embodiment, the sampling rate is "2" to provide an output resolution of "2" since a 4 pixel MC unit is being utilized. However, according to the present invention, other sampling rates may be chosen to provide a different resolution such as "4" or "1/8". At the output of the decimation block 18, decoded I-frames and residual error frames are produced at a reduced resolution. As can be seen, these frames are provided at one side of an adder 8.

As can be further seen, the decoder also includes a second path made up of the VLD 2, a down scaler 20, a  $\frac{1}{4}$  pixel MC 22 unit

\\SERVERO\SYS2\WPDOCS\GR\us010341-spec.doc

and a frame store 12. During operation, the down scaler 20 reduces the magnitude of the MVs provided by the VLD 2 proportional to the reduction in the first path. This will enable the motion compensation to be performed at a reduced resolution to match the frames produced in the first path. In this embodiment, the MVs are scaled down by a factor of "2" to match the sampling rate of the decimation unit 18.

The ¼ pixel MC unit 22 then performs motion compensation on pervious frames stored in the frame 12 store according to the scaled down MVs. In this embodiment, since the MVs have been scaled down by a factor of "2", the motion compensation will be performed at a "1/4" resolution. At the output of the ¼ pixel MC unit 22, motion compensated frames at a reduced resolution are produced. As can be seen, these frames are provided to the other side of the adder 8.

During operation, the adder 8 combines the frames from the first and second paths to produce video frames at a reduced resolution. As can be seen, the video frames from the adder 8 are then provided to an external up-scaler 24. The up-scaler 24 is external since it is placed outside the decoding loop. The up-scaler 8 increases the resolution of the video frames to the full display resolution. The increase in resolution is proportional to

the decrease that occurred internal to the decoding loop. In this embodiment, the up-scaler 24 will increase the resolution of the video frames by a factor of "2".

Further, the up-scaler 24 may also increase the resolution in the horizontal direction, vertical direction or both depending on the scaling done internally. For example, if the original resolution of the bitstream was "720X480" and it was reduced to "360X480" by the internal scaling, the up-scaler 24 would perform horizontal scaling from "360x480" to "720X480".

Another example of a decoder according to the present invention is shown in Figure 3. The decoder of Figure 3 is the same as Figure 2 except for the first path. As can be seen, in this example, the first path includes a VLD 2, an ISIQ/filtering/scaling block 40 and a 4X4 IDCT block 26. Therefore, in this example, the IDCT is performed at the reduced resolution which further reduces the overall computational complexity of the decoding.

During operation, the ISIQ/filtering/scaling block 40 inverse scans and inverse quantizes the DCT coefficients received from the VLD 2. The IDCT/filtering/scaling block 40 also performs filtering to eliminate high frequencies from the DCT coefficients. However, in this example, IDCT/filtering/scaling block 40 also performs

\\SERVERO\SYS2\WPDOCS\GR\us010341-spec.doc

scaling on the DCT coefficients received from the VLD 2. In this example, the IDCT/filtering/scaling block 40 will down scale 8X8 DCT blocks received from the VLD 2 to 4X4 blocks.

The 4X4 IDCT 26 then performs an inverse discrete transform in 4X4 blocks to produce blocks of pixel values. The output of the 4X4 IDCT 26 is then provided to one input of the adder 8

As in the previous example, the adder 8 combines the frames from the first and second paths to produce video frames at a reduced resolution. As previously described, decoded I-frames and residual error frames are produced by the first path 2,40,26, while motion compensated frames are produced by the second path 12,20,22. The up-scaler 24 then increases the resolution of the video frames to the full display resolution. In this example, the up-scaler also increases the resolution by a factor of "2" in both the horizontal and vertical direction.

According to the present invention, the decoders of Figures 2-3 may be implemented in hardware, software or a combination of both. In a software implementation, it is preferred that the upscaler 24 utilize a simple up-scaling technique such as just repeating pixel values or using a linear interpolation. In other embodiments, the up-scaler 24 may be implemented in hardware and thus a more complex technique may be used. For example, in the

\\SERVERO\SYS2\WPDOCS\GR\us010341-spec.doc

PHILIPS TRIMEDIA chip, a dedicated coprocessor is included for performing scaling. This coprocessor uses a programmable five-tap filter arrangement where additional pixel values are calculated based on a weighted average of five pixels. Therefore, the upscaler 24 may be implemented using this dedicated processor while the rest of the decoder may be implemented in software and run on the CPU core of the PHILIPS TRIMEDIA processor.

One example of a system in which the decoding utilizing embedded resizing in conjunction with external scaling may be implemented is shown in Figure 4. By way of example, the system may represent a television, a set-top box, a desktop, laptop or palmtop computer, a personal digital assistant (PDA), a video/image storage device such as a video cassette recorder (VCR), a digital video recorder (DVR), a TiVO device, etc., as well as portions or combinations of these and other devices. The system includes one or more video sources 28, one or more input/output devices 36, a processor 30, a memory 32 and a display device 38.

The video/image source(s) 28 may represent, e.g., a television receiver, a VCR or other video/image storage device. The source(s) 28 may alternatively represent one or more network connections for receiving video from a server or servers over, e.g., a global computer communications network such as the Internet, a wide area

NSERVERO\SYS2\WPDOCS\GR\us010341-spec.doc
network, a metropolitan area network, a local area network, a
terrestrial broadcast system, a cable network, a satellite network,
a wireless network, or a telephone network, as well as portions or
combinations of these and other types of networks.

The input/output devices 36, processor 30 and memory 32 communicate over a communication medium 34. The communication medium 34 may represent, e.g., a bus, a communication network, one or more internal connections of a circuit, circuit card or other device, as well as portions and combinations of these and other communication media. Input video data from the source(s) 28 is processed in accordance with one or more software programs stored in memory 32 and executed by processor 30 in order to generate output video/images supplied to the display device 38.

In one embodiment, the decoding utilizing embedded resizing in conjunction with external scaling is implemented by computer readable code executed by the system. The code may be stored in the memory 32 or read/downloaded from a memory medium such as a CD-ROM or floppy disk. In other embodiments, hardware circuitry may be used in place of, or in combination with, software instructions to implement the invention.

While the present invention has been described above in terms of specific examples, it is to be understood that the invention is

Noterous Not